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3 <110> APPLICANT: CAMPBELL , ANTHONY KEITH 

5 <120> TITLE OF INVENTION: PROTEIN AND DNA CODING THEREFOR 
7 <130> FILE REFERENCE: WCM.69.US 
9 <140> CURRENT APPLICATION NUMBER: 09/831, 142B 
10 <141> CURRENT FILING DATE: 2001-05-07 

12 <150> PRIOR APPLICATION NUMBER: PCT/GB99/03654 

13 <151> PRIOR FILING DATE: 1999-11-05 

15 <150> PRIOR APPLICATION NUMBER: GB 9824357.9 

16 <151> PRIOR FILING DATE: 1998-11-07 
18 <160> NUMBER OF SEQ ID NOS: 42 

20 <170> SOFTWARE: Patentln Ver. 2.1 

22 <210> SEQ ID NO: 1 

23 <211> LENGTH: 870 

24 <212> TYPE: DNA 

25 <213> ORGANISM: Pholas dactylus 

27 <220> FEATURE: 

28 <221> NAME/KEY: CDS 

29 <222> LOCATION: (30).. (704) 

31 <400> SEQUENCE: 1 

32 gaattcggca cgagtcggaa aagaacaaa atg get tgt ate gtt ttc gtt get 53 



33 


















Met Ala 


Cys 


He 


Val 


Phe 


Val 


Ala 




34 


















1 






5 










36 


ctt 


gtc 


get 


eta 


tgc 


tta 


atg 


caa 


ccg ggt 


tec 


ggt 


gag 


gaa 


gta 


caa 


101 


37 


Leu 


Val 


Ala 


Leu 


Cys 


Leu 


Met 


Gin 


Pro Gly 


Ser 


Gly 


Glu 


Glu 


Val 


Gin 




38 




10 










15 








20 












40 


tgc 


gcg 


atg 


aat 


tgg 


aca 


caa 


get 


aat gaa 


tat 


gtg 


ttc 


aac 


gtg 


gac 


149 


41 


Cys 


Ala 


Met 


Asn 


Trp 


Thr 


Gin 


Ala 


Asn Glu 


Tyr 


Val 


Phe 


Asn 


Val 


Asp 




42 


25 










30 








35 










40 




44 


tgg 


atg 


acc 


att 


ttc 


ate 


tac 


gac 


tat ggc 


get 


caa 


gag 


caa 


ctg 


tac 


197 


45 


Trp 


Met 


Thr 


He 


Phe 


He 


Tyr 


Asp 


Tyr Gly Ala 


Gin 


Glu 


Gin 


Leu 


Tyr 




46 










45 








50 










55 






48 


gaa 


gat 


egg 


get 


ttg 


ggg 


ctg 


tgt 


egg att 


gaa 


egg 


gee 


ggc 


cca 


ggt 


245 


49 


Glu 


Asp 


Arg 


Ala 


Leu 


Gly 


Leu 


Cys 


Arg He 


Glu 


Arg 


Ala 


Gly 


Pro 


Gly 




50 








60 










65 








70 








52 


acc 


aca 


aaa 


gee 


gtc 


tgg 


att 


aac 


tgg agt 


aac 


gac 


acg 


cag 


tea 


tgt 


293 


53 


Thr 


Thr 


Lys 


Ala 


Val 


Trp 


He 


Asn 


Trp Ser 


Asn 


Asp 


Thr 


Gin 


Ser 


Cys 




54 






75 










80 








85 










56 


gta 


aca 


aga 


aaa 


aca 


ate 


ttc 


ttc 


gag gtt 


ggt 


gga 


gaa 


att 


gee 


egg 


341 


57 


Val 


Thr 


Arg 


Lys 


Thr 


He 


Phe 


Phe Glu Val Gly 


Gly 


Glu 


He 


Ala 


Arg 




58 




90 










95 








100 












60 


eta 


gtt 


gac 


tac 


aga 


cca 


cag 


gaa 


gac gga 


act 


gag 


aaa 


act 


ttt 


aca 


389 


61 


Leu 


Val 


Asp 


Tyr 


Arg 


Pro 


Gin 


Glu 


Asp Gly 


Thr 


Glu 


Lys 


Thr 


Phe 


Thr 




62 


105 










110 








115 










120 
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fi A 




aaa 


ttc 


X.CX. 


age 


aaa 


atg 


cca 


ggc 


act 


tac 


atg 


ctt 


atg 


gac 


gtg 


437 




ft Y+rr 

Axg 


Lys 


Phe 




Cot- 
OCX 


Lys 


net 


Pro 


Lj±y 


rpU 

inr 


Tyr 


Met 


Leu 


Met 


Asp 


Val 




D D 




















loU 










13 5 






D O 


tgc 


get 


aca 


agg 


gsc 


get 


gat 


gat 


aaa 


tgc 


ate 


gaa 


ggc 


aca 


att 


gtg 


485 


fiQ 


o 


Ala 


Thr 


i\L y 


Asp 


Ala 

Ala 


Asp 


Asp 


Lys 


Cys 


i ie 


GlU 


Gly 


Thr 


lie 


Val 




70 
/ u 








1 AH 










14 D 










150 








7 ? 


rf "t" rr 


aca 


gtc 


agg 


gtg 


tec 


C La 


tat 


gac 


gaa 


gat 


aac 


aat 




gta 


atg 


C *"fc "1 

533 


7 ^ 


Vui 


Thr 


Val 


Arg 


VOl 


Oar 


Leu 


Tyr 


Asp 


Glu 


Asp 


Asn 


Asn 


Gly 


Val 


Met 




1 A 

I 4 






155 










160 










165 










1 6 


gat 


gaa 


ggt 


aag 


gtg 


att 


cca 


tct 


gag 


aca 


ate 


gag 


gat 


gat 


ate 


aag 


581 


77 


Asp 


Glu Gly 


Lys 


Val 


He 


Pro 


Ser 


Glu 


Thr 


He 


Glu 


Asp 


Asp 


He 


Lys 




78 




170 










175 










180 












80 


gac 


tgt 


ggg 


etc 


tta 


gac 


caa 


gat 


gtt 


gaa 


etc 


gat 


tat 


acg 


tgg 


act 


629 


81 


Asp 


Cys 


Gly 


Leu 


Leu 


Asp 


Gin 


Asp 


Val 


Glu 


Leu 


Asp 


Tyr 


Thr 


Trp 


Thr 




82 


185 










190 










195 










200 




84 


caa 


aac 


gag 


tgt 


gat 


eta 


cca 


gac 


aca 


gta 


gac 


gag 


get 


gaa 


gac 


aca 


677 


85 


Gin 


Asn 


Glu 


Cys 


Asp 


Leu 


Pro 


Asp 


Thr 


Val 


Asp 


Glu 


Ala 


Glu 


Asp 


Thr 




86 










205 










210 










215 






88 


ccg 


tea 


gaa 


act 


gga 


gaa 


ttc 


ttc 


tgg 


tagatctatc agactacttt 




724 


89 


Pro 


Ser 


Glu 


Thr 


Gly Glu 


Phe 


Phe 


Trp 


















90 








220 










225 



















92 tatcagcagg acaactggtc gttaccagac acctataacg tgtcctcatc aataatgtgt 784 
94 aaaacagaaa taatcgatag aatattgaaa ataaaatgtt aataaacact ggttgaaata 84 4 
96 tgaaaaaaaa aaaaaaaaaa ctcgag 870 

98 <210> SEQ ID NO: 2 

99 <211> LENGTH: 816 

100 <212> TYPE: DNA 

101 <213> ORGANISM: Pholas dactylus 

103 <400> SEQUENCE: 2 

104 gaatteggea cgagggaaaa gaacaaaatg gcttgtatcg ttttcgttgc tettgtcget 60 
10 5 etatgettaa tgcaaccggg ttccggtgag gaagtacaat gcgcgatgaa ttggacacaa 120 

106 gctaatgaat atgtgttcaa cgtggactgg atgaccattt tcatctacga etatggeget 180 

107 caagagcaac tgtacgagga tegggctttg gggctgtgtc ggattgaacg ggccggccca 240 

108 ggtaccacaa aagccgtctg gattaactgg agtaacgaca cgcagtcatg tgtaacaaga 300 

109 aaaacaatct tcttcgaggt tggtggagaa attgcccggc tagttgacta cagaccacag 360 

110 gaagacggaa ctgagaaaac ttttacaaga aaattctcta geaaaatgee aggcacttac 420 

111 atgcttatgg acgtgtgcgc tacaagggac gctgatgata aatgeatega aggcacaatt 4 80 

112 gtggtgacag tcagggtgtc cctatatgac gaagataaca atggtgtaat ggatgaaggt 540 

113 aaggttattc catctgagac aatcgaggat' gatatcaagg actgtgggct cttagaccaa 600 

114 gatgttgaac tcgattatac gtggactcaa aacgagtgtg atctaccaga cacagtagac 660 

115 gaggctgaag acacaccgtc agaaactgga gaattcttct ggtagatcta tcagaccact 720 

116 tttatcagca ggacaactgg tcgttaccag acacctataa cgtgtcctca tcaataatgt 780 

117 gtaaaacaga aataatcgat agaatattga aaataa 816 

120 <210> SEQ ID NO: 3 

121 <211> LENGTH: 852 

122 <212> TYPE: DNA 

123 <213> ORGANISM: Pholas dactylus 

125 <400> SEQUENCE: 3 

126 gtcggaaaag aacaaaatgg cttgtatcgt tttcgttgct cttgtcgctc tatgettaat 60 
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127 gcaaccgggt tccggtgagg aagtacaatg cgcgatgaat tggacacaag 

128 tgtgttcaac gtggactgga tgaccatttt catctacgac tatggcgctc 

129 gtacgaggat cgggctttgg ggctgtgtcg gattgaacgg gccggcccag 

130 agccgtctgg attaactgga gtaacgacac gcagtcatgt gtaacaagaa 

131 cttcgaggtt ggtggagaaa ttgcccggct agttgactac agaccacagg 

132 tgagaaaact tttacaagaa aattctctag caaaatgcca ggcacttaca 

133 cgtgtgcgct acaagggacg ctgatgataa atgcatcgaa ggcacaattg 

134 cagggtgtcc ctatatgacg aagataacaa tggtgtaatg gatgaaggta 

135 atctgagaca atcgaggatg atatcaagga ctgtgggctc ttagaccaag 

136 cgattatacg tggactcaaa acgagtgtga tctaccagac acagtagacg 

137 cacaccgtca gaaactggag aattcttctg gtagatctat cagaccactt 

138 gacaactggt cgttaccaga cacctataac gtgtcctcat caataatgtg 

139 ataatcgata gaatattgaa aataaaatgt taatagacac tggttgaaaa 

140 aaaaaactcg ag 
<210> SEQ ID NO: 4 

LENGTH: 225 



143 
144 



ctaatgaata 120 
aagagcaact 180 
gtaccacaaa 240 
aaacaatctt 300 
aagacggaac 360 
tgcttatgga 420 
tggtgacagt 480 
aggttattcc 54 0 
atgttgaact 600 
aggctgaaga 660 
ttatcagcag 720 
taaaacagaa 780 
aaaaaaaaaa 840 
852 



<211> 



145 <212> TYPE: PRT 

146 <213> ORGANISM: Pholas dactylus 
148 <400> SEQUENCE : 4 

14 9 Met Ala Cys He Val Phe Val Ala Leu Val Ala 
150 1 5 10 

152 Pro Gly Ser Gly Glu Glu Val Gin Cys Ala Met 

153 20 25 

155 Asn Glu Tyr Val Phe Asn Val Asp Trp Met Thr 

156 35 40 

158 Tyr Gly Ala Gin Glu Gin Leu Tyr Glu Asp Arg 

159 50 55 

161 Arg He Glu Arg Ala Gly Pro Gly Thr Thr Lys 

162 65 70 75 
164 Trp Ser Asn Asp Thr Gin Ser Cys Val Thr Arg 



165 85 90 

167 Glu Val Gly Gly Glu He Ala Arg Leu Val Asp 

168 100 105 

170 Asp Gly Thr Glu Lys Thr Phe Thr Arg Lys Phe 

171 115 120 

173 Gly Thr Tyr Met Leu Met Asp Val Cys Ala Thr 

174 130 135 

176 Lys Cys He Glu Gly Thr He Val Val Thr Val 

177 145 150 155 

179 Asp Glu Asp Asn Asn Gly Val Met Asp Glu Gly 

180 165 170 

182 Glu Thr He Glu Asp Asp He Lys Asp Cys Gly 

183 180 185 

185 Val Glu Leu Asp Tyr Thr Trp Thr Gin Asn Glu 

186 195 200 

188 Thr Val Asp Glu Ala Glu Asp Thr Pro Ser Glu 

189 210 215 

191 Trp 

192 225 



Leu Cys Leu 

Asn Trp Thr 
30 

He Phe He 
45 

Ala Leu Gly 
60 

Ala Val Trp 

Lys Thr lie 

Tyr Arg Pro 
110 

Ser Ser Lys 

125 
Arg Asp Ala 
140 

Arg Val Ser 

Lys Val lie 

Leu Leu Asp 
190 

Cys Asp Leu 

205 
Thr Gly Glu 
220 



Met Gin 
15 

Gin Ala 
Tyr Asp 
Leu Cys 

lie Asn 

80 

Phe Phe 
95 

Gin Glu 

Met Pro 

Asp Asp 

Leu Tyr 
160 
Pro Ser 
175 

Gin Asp 
Pro Asp 
Phe Phe 
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195 


<210> SEQ ID NO 


: 5 
























196 


<211> LENGTH: 205 
























197 


<212> TYPE: 


PRT 


























198 


<213> ORGANISM: 


Pholas dactylus 


















200 


<400> SEQUENCE : 


5 
























201 


Glu 


Glu 


Val 


Gin 


Cys 


Ala 


Met 


Asn 


Trp 


Thr 


Gin 


Ala 


Asn 


.Glu 


Tyr 


Val 


202 


1 








5 










10 










15 




204 


Phe 


Asn 


Val 


Asp 


Trp 


Met 


Thr 


He 


Phe 


He 


Tyr 


Asp 


Tyr 


Gly 


Ala 


Gin 


205 








20 










25 










30 






207 


Glu 


Gin 


Leu 


Tyr 


Glu 


Asp 


Arg 


Ala 


Leu 


Gly 


Leu 


Cys 


Arg 


He 


Glu 


Arg 


208 






35 










40 










45 








210 


Ala 


Gly 


Pro 


Gly 


Thr 


Thr 


Lys 


Ala 


Val 


Trp 


He 


Asn 


Trp 


Ser 


Asn 


Asp 


211 




50 










55 










60 










213 


Thr 


Gin 


Ser 


Cys 


Val 


Thr 


Arg 


Lys 


Thr 


He 


Phe 


Phe 


Glu 


Val 


Gly 


Gly 


214 


65 










70 










75 










80 


216 


Glu 


lie 


Ala 


Arg 


Leu 


Val 


Asp 


Tyr 


Arg 


Pro 


Gin 


Glu 


Asp 


Gly 


Thr 


Glu 


217 










85 










90 










95 




219 


Lys 


Thr 


Phe 


Thr 


Arg 


Lys 


Phe 


Ser 


Ser 


Lys 


Met 


Pro 


Gly 


Thr 


Tyr 


Met 


220 








100 










105 










110 






222 


Leu 


Met 


Asp 


Val 


Cys 


Ala 


Thr 


Arg 


Asp 


Ala 


Asp 


Asp 


Lys 


Cys 


He 


Glu 


223 






115 










120 










125 








225 


Gly 


Thr 


He 


Val 


Val 


Thr 


Val 


Arg 


Val 


Ser 


Leu 


Tyr 


Asp 


Glu 


Asp 


Asn 


226 




130 










135 










140 










228 


Asn 


Gly 


Val 


Met 


Asp 


GlU 


Gly 


Lys 


val 


He 


Pro 


Ser 


Glu 


Thr 


He 


Glu 


229 


145 










150 










155 










160 


231 


Asp 


Asp 


He 


Lys 


Asp 


Cys 


Gly 


Leu 


Leu 


Asp 


Gin 


Asp 


Val 


Glu 


Leu 


Asp 


232 










165 










170 










175 




234 


Tyr 


Thr 


Trp 


Thr 


Gin 


Asn 


Glu 


Cys 


Asp 


Leu 


Pro 


Asp 


Thr 


Val 


Asp 


Glu 


235 








180 










185 










190 






237 


Ala 


Glu 


Asp 


Thr 


Pro 


Ser 


Glu 


Thr 


Gly 


Glu 


Phe 


Phe 


Trp 








238 






195 










200 










205 








241 


<210> SEQ ID NO: 


: 6 
























242 


<211> LENGTH: 225 
























243 


<212> TYPE: 


PRT 


























244 


<213> ORGANISM: 


Pholas dactylus 


















246 


<400> SEQUENCE: 


6 
























247 


Met 


Ala 


Cys 


He 


Val 


Phe 


Val 


Ala 


Leu 


Val 


Ala 


Leu 


Cys 


Leu 


Met 


Gin 


248 


1 








5 










10 










15 




250 


Pro 


Gly 


Ser 


Gly 


Glu 


Glu 


Val 


Gin 


Cys 


Ala 


Met 


Asn 


Trp 


Thr 


Gin 


Ala 


251 








20 










25 










30 






253 


Asn 


Glu 


Tyr 


Val 


Phe 


Asn 


Val 


Asp 


Trp 


Met 


Thr 


He 


Phe 


He 


Tyr 


Asp 


254 






35 










40 










45 








256 


Tyr 


Gly 


Ala 


Gin 


Glu 


Gin 


Leu 


Tyr 


Glu 


Asp 


Arg 


Ala 


Leu 


Gly 


Leu 


Cys 


257 




50 










55 










60 










259 


Arg 


lie 


Glu 


Arg 


Ala 


Gly 


Pro 


Gly 


Thr 


Thr 


Lys 


Ala 


Val 


Trp 


He 


Asn 


260 


65 










70 










75 










80 


262 


Trp 


Ser 


Asn 


Asp 


Thr 


Gin 


Ser 


Cys 


Val 


Thr 


Arg 


Lys 


Thr 


He 


Phe 


Phe 


263 










85 










90 










95 




265 


Glu 


Val 


Gly 


Gly 


Glu 


He 


Ala 


Arg 


Leu 


Val 


Asp 


Tyr 


Arg 


Pro 


Gin 


Glu 
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266 








100 










105 










110 






268 


Asp 


Gly 


Thr 


Glu 


Lys 


Thr 


Phe 


Thr 


Arg 


Lys 


Phe 


Ser 


Ser 


Lys 


Met 


Pro 


269 






115 










120 










125 








271 


Gly 


Thr 


Tyr 


Met 


Leu 


Met 


Asp 


Val 


Cys 


Ala 


Thr 


Arg 


ASp 


Ala 


Asp 


Asp 


272 




130 










135 










140 










274 


Lys 


Cys 


lie 


Glu 


Gly 


Thr 


lie 


Val 


Val 


Thr 


Val 


Arg 


Val 


Ser 


Leu 


Tyr 


275 


14 5 










150 










155 










160 


277 


Asp 


Glu 


Asp 


Asn 


Asn 


Gly 


Val 


Met 


Asp 


Glu Gly Lys 


Val 


He 


Pro 


Ser 


278 










165 










170 










175 




280 


Glu 


Thr 


lie 


Glu 


Asp 


Asp 


He 


Lys 


Asp 


Cys 


Gly 


Leu 


Leu 


ASp 


Gin 


Asp 


281 








180 










185 










190 






283 


Val 


Glu 


Leu 


Asp 


Tyr 


Thr 


Trp 


Thr 


Gin 


Asn 


Glu 


Cys 


Asp 


Leu 


Pro 


Asp 


284 






195 










200 










205 








286 


Thr 


Val 


Asp 


Glu 


Ala 


Glu 


Asp 


Thr 


Pro 


Ser 


Glu 


Thr 


Gly 


Glu 


Phe 


Phe 


287 




210 










215 










220 











289 Trp 

290 225 

293 <210> SEQ ID NO : 7 

294 <211> LENGTH: 17 

295 <212> TYPE: DNA 

296 <213> ORGANISM: Artificial sequence 

298 <220> FEATURE: 

299 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

300 oligonucleotide 

302 <220> FEATURE: 

303 <221> NAME/KEY: modif ied_base 

304 <222> LOCATION: (3) 

305 <223> OTHER INFORMATION : i 
307 <400> SEQUENCE: 7 

W--> 308 acnathttyt tycargt 17 

311 <210> SEQ ID NO: 8 

312 <211> LENGTH: 17 

313 <212> TYPE: DNA 

314 <213> ORGANISM: Artificial sequence 

316 <220> FEATURE: 

317 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

318 oligonucleotide 

320 <220> FEATURE: 

321 <221> NAME/KEY: modif ied_base 

322 <222> LOCATION: (12) 

323 <223> OTHER INFORMATION: A, T, C or G 

325 <220> FEATURE : 

326 <221> NAME/ KEY : modif ied_base 

327 <222> LOCATION: (15) 

328 <223> OTHER INFORMATION: i 
330 <400> SEQUENCE: 8 

W--> 3 31 cargargarg gnacnga 17 

334 <210> SEQ ID NO: 9 

335 <211> LENGTH: 17 



Use of n and / or Xaa has been detected in the 
Sequence Listing. Review the Sequence Listing 
^_^=b to ensure a corresponding explanation is present 
* J in the <220> to <223> fields of each sequence 
using n or Xaa. 
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L 


308 


M 


341 


W 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


:7 


L 


331 


M 


341 


W 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


8 


L 


354 


M 


341 


W 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


9 


L 


372 


M 


341 


W 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


10 


L: 


557 


M: 


341 


W 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


23 
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